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. Abstract 

In this paper we prove that for any infinite word w whose set of factors is closed under 
reversal, the following conditions are equivalent: 

(I) all complete returns to palindromes are palindromes; 

(II) V{n) + V(n + 1) = C{n + 1) - C(n) + 2 for all n, 

where V (resp. C) denotes the palindromic complexity (resp. factor complexity) function of w, 
• which counts the number of distinct palindromic factors (resp. factors) of each length in w. 
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Pj ■ 1 Introduction 

m 

Given an infinite word w, let V(n) (resp. C{n)) denote the palindromic complexity (resp. factor 
complexity) of w, i.e., the number of distinct palindromic factors (resp. factors) of w of length n. 
In pp, J. -P. Allouche, M. Baake, J. Cassaigne, and D. Damanik established the following inequality 
' relating the palindromic and factor complexities of a non-ultimately periodic infinite word: 
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V(n) < —C(n + 
n y 



for all n G N. 



More recently, using Rauzy graphs, P. Balazi, Z. Masakova, and E. Pelantova [5] proved that for 
any uniformly recurrent infinite word whose set of factors is closed under reversal, 

V{n) + V(n + 1) < C(n + 1) - C(n) + 2 for all n G N. (1.1) 

They also provided several examples of infinite words for which V(n) + V(n + 1) always reaches 
the upper bound given in relation (jl.lj) . Such infinite words include Arnoux-Rauzy sequences, 



*Dipartimento di Matematica e Applicazoni "R. Caccioppoli", Universita degli Studi di Napoli Federico II, Via 
Cintia, Monte S. Angelo, 1-80126, Napoli, ITALY (micbucci@unina.it). 

^Dipartimento di Matematica e Applicazoni "R. Caccioppoli", Universita degli Studi di Napoli Federico II, Via 
Cintia, Monte S. Angelo, 1-80126, Napoli, ITALY (alessandro.deluca@unina.it). 

* Corresponding author: LaCIM, Universite du Quebec a Montreal, CP. 8888, succursale Centre-ville, Montreal, 
Quebec, H3C 3P8, CANADA (amy.glen@gmail.com). Supported by CRM, ISM, and LaCIM. 

§ Institut Camille Jordan, Universite Claude Bernard Lyon 1, 43 boulevard du 11 novembre 1918, 69622 Villeur- 
banne Cedex FRANCE (luca@unt.edu). 



1 



complementation-symmetric sequences, certain words associated with /3-expansions where /3 is a 
simple Parry number, and a class of words coding r-interval exchange transformations. 

In this paper we give a characterization of all infinite words with factors closed under reversal 
for which the equality V(n) + V{n + 1) = C{n + 1) — C{n) + 2 holds for all n: these are exactly the 
infinite words with the property that all 'complete returns' to palindromes are palindromes. Given 
a finite or infinite word w and a factor ji of w, we say that a factor r of w is a complete return to u 
in w if r contains exactly two occurrences of u, one as a prefix and one as a suffix. Return words 
play an important role in the study of minimal subshifts; see [12] [T3J, HU HU [20], [24] . 

Our main theorem is the following: 

Theorem 1.1. For any infinite word w whose set of factors is closed under reversal, the following 
conditions are equivalent: 

(I) all complete returns to any palindromic factor of w are palindromes; 

(II) V(n) + V(n + 1) = C(n + 1) - C(n) + 2 for all n G N. 

Recently, in [19J, it was shown that property (I) is equivalent to every factor u of w having 
exactly \u\ + 1 distinct palindromic factors (including the empty word). Such words are 'rich' in 
palindromes in the sense that they contain the maximum number of different palindromic factors. 
Indeed, X. Droubay, J. Justin, and G. Pirillo |10j observed that any finite word w of length \w\ 
contains at most \w\ + 1 distinct palindromes. 

The family of finite and infinite words having property (I) are called rich words in [19]. In 
independent work, P. Ambroz, C. Frougny, Z. Masakova, and E. Pelantova [2] have considered 
the same class of words which they call full words, following earlier work of S. Brlek, S. Hamel, 
M. Nivat, and C. Reutenauer in [6]. 

Rich words encompass the well-known family of episturmian words originally introduced by 
X. Droubay, J. Justin, and G. Pirillo in |10j (see SectionHJfor more details). Another special class of 
rich words consists of S. Fischler's sequences with "abundant palindromic prefixes", which were intro- 
duced and studied in [16J in relation to Diophantine approximation (see also [17]). Other examples 
of rich words that are neither episturmian nor of "Fischler type" include: non-recurrent rich words, 
like abbbb ■ ■ ■ and abaabaaabaaaab • • ■ ; the periodic rich infinite words: (aab k aabab)(aab k aabab) ■ ■ ■ , 
with k > 0; the non-ultimately periodic recurrent rich infinite word ip(f) where f = abaababaaba ■ ■ ■ 
is the Fibonacci word and ip is the morphism: a i— > aab k aabab, b i— > bob; and the recurrent, but not 
uniformly recurrent, rich infinite word generated by the morphism: a \— > aba, b i— > bb. (See |19| for 
these examples and more.) 

From the work in [10] 119]. we have the following equivalences. 

Proposition 1.2. A finite or infinite word w is rich if equivalently: 

• all complete returns to any palindromic factor of w are palindromes; 

• every factor u of w contains \u\ + 1 distinct palindomes; 

• the longest palindromic suffix of any prefix p of w occurs exactly once in p. 

From the perspective of richness, our main theorem can be viewed as a characterization of 
recurrent rich infinite words since any rich infinite word is recurrent if and only if its set of factors 
is closed under reversal (see [19J or Remark 12. ip . Interestingly, the proof of Theorem 11.11 relies 
upon another new characterization of rich words (Proposition I2.3[) , which is useful for establishing 
the key step, namely that the so-called super reduced Rauzy graph is a tree. This answers a claim 
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made in the last few lines of [5] where it was remarked that the Rauzy graphs of words satisfying 
equality (II) must have a very special form. 

After some preliminary definitions and results in the next section, Section 3 is devoted to the 
proof of Theorem 11.11 and some interesting consequences are proved in Section 4. 

2 Preliminaries 

2.1 Notation and terminology 

In this paper, all words are taken over a finite alphabet A, i.e., a finite non-empty set of symbols 
called letters. A finite word over A is a finite sequence of letters from A. The empty word e is 
the empty sequence. A (right) infinite word x is a sequence indexed by N+ with values in A, i.e., 
x = X1X2X3 ■ ■ ■ with each X{ £ A. For easier reading, infinite words are hereafter typed in boldface 
to distinguish them from finite words. 

Given a finite word w = x\x<i ■ ■ ■ x m (where each xi is a letter), the length of w, denoted by \w\, 
is equal to m. By convention, the empy word is the unique word of length 0. We denote by w the 
reversal of w, given by w = x m ■ ■ ■ x^x\. If w = w, then w is called a palindrome. 

A finite word z is a factor of a finite or infinite word w if w = uzv for some words u, v. In the 
special case u = e (resp. v = e), we call z a prefix (resp. suffix) oiw. If u 7^ e and w/e, then we say 
that z is an interior factor of w = uzv. Moreover, z is said to be a central factor of w if \u\ = \v\. 
We say that z is unioccurrent in w if z occurs exactly once in w. For any finite or infinite word w, 
the set of all factors of w is denoted by F(w) and we denote by F n (w) the set of all factors of w of 
length n, i.e., F n (w) := F(w) nA n (where \w\ > n if w is finite). We say that F(w) is closed under 
reversal if for any u € F(w), u € F(w). 

A factor of an infinite word w is recurrent in w if it occurs infinitely often in w, and w itself is 
said to be recurrent if all of its factors are recurrent in it. Furthermore, w is uniformly recurrent if 
any factor of w occurs infinitely many times in w with bounded gaps. 

Remark 2.1. A noteworthy fact (proved in [19]) is that a rich infinite word is recurrent if and 
only if its set of factors is closed under reversal. 

More generally, we have the following well-known result: 

Proposition 2.2 (folklore). If w is an infinite word with F(w) closed under reversal, then w is 
recurrent. 

Proof. Consider some occurrence of a factor u in w and let v be a prefix of w containing u. As 
-F(w) is closed under reversal, v € F(yv). Thus, if v is long enough, there is an occurrence of u 
strictly on the right of this particular occurrence of u in w. Similarly u occurs on the right of this 
u and thus u is recurrent in w. □ 

2.2 Key results 

We now prove two useful results, the first being a new characterization of rich words. 

Proposition 2.3. A finite or infinite word w is rich if and only if, for each factor v E F(w), any 
factor of w beginning with v and ending with v and not containing v or v as an interior factor is 
a palindrome. 
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Proof. ONLY IF: Consider any factor v G F(w) and let u be a factor of w beginning with v and 
ending with v and not containing v or v as an interior factor. If v is a palindrome, then either 
u = v = v (in which case u is clearly a palindrome), or u is a complete return to i; in w, and hence 
u is (again) a palindrome by Proposition 11.21 Now assume that v is not a palindrome. 

Suppose by way of contradiction that u is not a palindrome and let p be the longest palindromic 
suffix of u (which is unioccurrent in u by richness). Then \p\ < \u\ as u is not a palindrome. If 
\p\ > \v\, then v is a proper suffix of p, and hence v is a proper prefix of p. But then v is an interior 
factor of u, a contradiction. On the other hand, if \p\ < \v\, then \p\ ^ \v\ and p is a proper suffix 
of v (as S is not a palindrome), and hence p is a proper prefix of v. Thus p is both a prefix and a 
suffix of u; in particular p is not unioccurrent in it, a contradiction. 

IF: The given conditions tell us that any complete return to a palindromic factor v (= v) of w 
is a palindrome. Hence w is rich by Proposition 11.21 □ 

Proposition 2.4. Suppose w is a rich word. Then, for any non-palindromic factor v of w, v is a 
unioccurrent factor of any complete return to v in w. 

Proof. Let r be a complete return to v in w and let p be the longest palindromic suffix of r. Then 
\p\ > \v\; otherwise, if \p\ < \v\, then p would occur at least twice in r (as a suffix of each of the two 
occurrences of u in r), which is impossible as r is rich. Thus v is a proper suffix of p, and hence v 
is a proper prefix of p. So v is clearly an interior factor of r. 

It remains to show that v is unioccurrent in r. Arguing by contradiction, we suppose that v 
occurs more than once in r. Then a complete return r' to v occurs as a proper factor of r. Using the 
same reasoning as above, v is an interior factor of r', and hence an interior factor of r, contradicting 
the fact that r is a complete return to v. Thus v is unioccurent in r. □ 

Note. The above proposition tells us that for any factor v of a rich word w, occurrences of v and v 
alternate in w. 



Following the method of Balazi et al. [5] , a key tool for the proof of our main theorem is the notion 
of a Rauzy graph, defined as follows. Given an infinite word w, the Rauzy graph of order n for w, 
denoted by r n (w), is the directed graph with set of vertices -F n (w) and set of edges F n+ %(w) such 
that an edge e G F n+ \ (w) starts at vertex v and ends at a vertex v' if and only if v is a prefix of 
e and v' is a suffix of e. For a vertex v, the out-degree of v (denoted by deg + (t>)) is the number 
of distinct edges leaving v, and the in- degree of v (denoted by deg~(v)) is the number of distinct 
edges entering v. More precisely: 

deg + (v) = [J{x G A | vx G F n+ i(w)} and deg~(t;) = [J{x G A \ xv G F n+ i(w)}. 

We observe that, for all n G N, 



3 Proof of Theorem 11.11 



deg + (v) 



JJF n+1 (w)= J2 



deg (u). 



i)GF„(w) 



«eF„(w) 



(Note that flF n+ i(w) = C(n + 1).) Hence 




(deg (v) - 1). 




«SF„(w) 
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It is therefore easy to see that a factor v G F n (~w) positively contributes to C{n + 1) — C(n) if and 
only if deg + (v) > 2, i.e., if and only if there exist at least two distinct letters a, b such that va, 
vb G F n+ i(w), in which case v is said to be a right-special factor of w. Similarly, a factor v G F n (w) 
is said to be a left-special factor of w if there exist at least two distinct letters a, b such that av, 
bv G F n+ i(w). A factor of w is said to be special if it is either left-special or right-special (not 
necessarily both). With this terminology, if we let S n (w) denote the set of special factors of w of 
length n, then formula (|3.ip may be expressed as: 



Using similar terminology to that in [5], a directed path P in the Rauzy graph r n (w) is said to 
be a simple path of order n if it begins with a special factor v and ends with a special factor v' and 
contains no other special factors, i.e., P is a directed path of the form vv' or vz\ ■ ■ ■ z^v' where each 
Zi is a non-special factor of length n. A special factor v G S n (w) is called a trivial simple path of 
order n. 

In what follows, we use the following terminology for paths. Hereafter, "path" should be taken 
to mean "directed path". 

Definition 3.1. Suppose w is an infinite word and let P = v ■ ■ ■ v' be a path in T n (w). 

• The first vertex v (resp. last vertex v') is called the initial vertex (resp. terminal vertex) of P. 

• A vertex of P that is neither an initial vertex nor a terminal vertex of P is called an interior 
vertex of P. 

• P is said to be a non-trivial path if it consists of at least two distinct vertices. 

• The reversal P of the path P is the path obtained from P be reversing all edge labels (and 
arrows) and all labels of vertices. 

• We say that P is palindromic (or that P is invariant under reversal) if P = P. 

Note. Given a path P in r n (w), the reversal of P does not necessarily exist in T n (w). 

Suppose P = W1W2 ■ • • Wk is a non-trivial path in T n (w), and for each i with 1 < i < k, let cij 
and bi denote the respective first and last letters of iOj. Then, by the definition of r n (w), we have 
wib2 ■ ■ - bk = a\ ■ ■ ■ at-iWk- We call this word the label of the path P, denoted by ip. Note that 
the z-th shift of ip := w\b2 ■ ■ ■ b^ begins with u>i+i for all i with 1 < i < k — 1. 

For our purposes, it is convenient to consider the reduced Rauzy graph of order n, denoted 
by r^(w), which is the directed graph obtained from r ra (w) by replacing each simple path P = 
W1W2 ■ • • w/j-iWfc with a directed edge w\ — > labelled by ip. Thus the set of vertices of T' n (w) is 
S n (w). For example, consider the (rich) Fibonacci word: 



which is generated by the Fibonacci morphism ip : a i— > ab, b i— > a. The reduced Rauzy graph T' 2 (f) 
consists of the two (special) vertices: ab, ba and three directed edges: ab — > ba, ba — > ba, ba — > ab 
with respective labels: aba, baab, bab. 

Lemma 3.2. Let w be a rich infinite word and suppose P = w±W2 ■ ■ ■ Wk is a non-trivial path in 
r„(w) with k>2. Then the label ip = ui\b2 ■ ■ ■ b^ is a rich word. 




(deg + (t;) - 1) for all n G N. 



(3.2) 



v eS„(w) 



f = abaababaabaababaababaabaababaabaababaababaabaababaaba ■ ■ ■ 
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Proof. We proceed by induction on the number of vertices k in P. The lemma is clearly true for 
k = 2 since lp = w\b2 is a factor of w of length n + 1. Now suppose > 3 and assume that the 
label of any path consisting of k — 1 vertices is rich. Consider any path consisting of k vertices, 
namely P = w\W2 ■ ■ ■ Wk, and suppose by way of contradiction that its label lp = w\b2 ■ ■ ■ bk is 
not rich. Then the longest palindromic prefix p of lp occurs more than once in lp. Hence there 
exists a complete return r to p which is a prefix of lp. It follows that r = lp, otherwise r would 
be a factor of the prefix u := w\b2 ■ ■ ■ b^_\ of lp, and hence a palindrome since u is rich by the 
induction hypothesis. But this contradicts the maximality of the palindromic prefix p. So lp is 
a non-palindromic complete return to p. Let q be the longest palindromic prefix of u (which is 
unioccurrent in u by richness). If \p\ > \q\, then q is a proper prefix of p, and hence q occurs 
more than twice in u, a contradiction. On the other hand, if \p\ < \q\, then p is a prefix of q, and 
hence p is an interior factor of lp (occurring as a suffix of q), a contradiction. Thus lp is rich, as 
required. □ 

The proof of Theorem 1 1 . 1 1 relies upon the following extensions of Propositions 12.3^12.41 to paths. 

Lemma 3.3. (Analogue of Proposition 12.31 ) Suppose w is a rich infinite word and let v be any 
factor of w of length n. If P = v ■ ■ ■ v is a path from v to v in T n (w) that does not contain v or v 
as an interior vertex, then P is palindromic. This property also holds for paths in T^(w). 

Proof. We first observe that if P consists of a single vertex, then P = v = v, and hence P is 
palindromic. Now suppose P is a non-trivial path. If P = vv, then P is clearly palindromic. 
So suppose P = vz\ ■ ■ ■ ZkV where the z% are factors of w of length n. By definition, the label 
lp = vbi ■ ■ ■ bkbk+i begins with v and ends with v and contains neither v nor v as an interior factor 
(otherwise P would contain v or v as an interior vertex, which is not possible). Thus, as lp is rich 
(by Lemma l3,2h . it follows that lp is a palindrome by Proposition 12. 3| whence P must be invariant 
under reversal too. It is easy to see that this property is also true for paths in the reduced Rauzy 
graph r^(w). □ 

Lemma 3.4. (Analogue of Proposition 12.41 ) Suppose w is a rich infinite word and let v be any 
non-palindromic factor ofw of length n. If P = v ■ ■ ■ v is a non-trivial path in r n (w) that does not 
contain v as an interior vertex, then P passes through v exactly once. This property also holds for 
paths in r^(w). 

Note. Of particular usefulness is the fact that any path from v to v must pass through v. 

Proof. Let us write P = vz\ ■ ■ ■ z^v where the Zi are factors of w of length n. By definition, the 
label lp = vbi ■ ■ ■ b^bk+i contains exactly two occurrences of v, one as a prefix and one as a suffix 
(otherwise, if lp contained v as an interior factor, then v would be an interior vertex of P, which 
is not possible). Thus, as lp is rich (by Lemma 13 . 2(> . it follows that v is a unioccurrent (interior) 
factor of lp by Proposition I2.4( whence P passes through v exactly once. It is easy to see that this 
property is also true for paths in the reduced Rauzy graph T^(w). □ 

3.1 (I) implies (II) 

Suppose w is an infinite word with .F(w) closed under reversal and satisfying property (I). Then 
w is recurrent by Proposition 12.21 (i.e., w is a recurrent rich infinite word). Moreover, recurrence 
implies that for all n, the Rauzy graph r n (w) is strongly connected, i.e., there exists a directed 
path from any vertex v to every other vertex v' in r n (w). 
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Fix n 6 N and let us now consider the super reduced Rauzy graph of order n, denoted by 
r"(w), whose set of vertices consists of all [v] := {v,v} where v is any special factor of length n. 
Any two distinct vertices [v], [w] (with v {w,w}) are joined by an undirected edge with label 
[£p] := {ip,lp} if P or P is a simple path beginning with v or v and ending with w or w. For 
example, in the case of the Fibonacci word, T^f) consists of only one vertex: [ab]. In general, 
the super reduced Rauzy graph consists of more than one vertex and may contain multiple edges 
between vertices. 

Suppose r"(w) consists of s vertices; namely [vi\, i = 1, . . ., s. Since r n (w) is strongly connected 
(by recurrence), r"(w) is connected; thus it contains at least s — 1 edges. 

Now, from Lemma 13. 3} we know that if v is a special factor, any simple path from v to v is 
palindromic (i.e., invariant under reversal). Moreover, by closure under reversal, if there exists a 
simple path P from a special factor v to a special factor w, with v ^ {w,w}, then there is also 
a simple path from w to v (namely, the reversal of the path P). Neither of these simple paths is 
palindromic. 

We thus deduce that there exist at least 2(s — 1) non-trivial simple paths in the Rauzy graph 
r n (w) that are non-palindromic (i.e., not invariant under reversal). In fact, we will show that there 
are exactly 2(s— 1) non-trivial simple paths of order n that are non-palindromic. Indeed, if this true 
then, as each palindromic factor of length n or n + 1 is a central factor of a (unique) palindromic 
simple path of order n, we have: 

V{n)+V(n + l) = deg + (v)-2(s-l)+p (3.3) 

ueSVi(w) 

where, on the right hand side, the first summand is the total number of non-trivial simple paths, 
the second summand is the number of non-trivial simple paths that are non-palindromic, and p is 
the number of special palindromes of length n (i.e., the number of trivial simple paths of order n 
that are palindromic). By observing that the number of special factors of length n is 2s — p, we 
can simplify equation (|3.3p to obtain the required equality (II) as follows: 

V(n)+V(n + 1) = deg + (t>) - (2s - p) + 2 

= (deg + W-l) + 2 

= C(n + 1) - C(n) + 2 fbv Of ). 

We observe, in particular, that any infinite word w with F(w) closed under reversal satisfies 
equality (II) if and only if any simple path between a special factor and its reversal is palindromic, 
and for each n, there are exactly 2(s— 1) non-trivial simple paths of order n that are non-palindromic. 
The latter condition says that, for all n, the super reduced Rauzy graph T"(w) contains exactly 
s — 1 edges (with each edge corresponding to a simple path and its reversal), and hence r"(w) is 
a tree as it contains s vertices, s — 1 edges, and must be connected by the recurrence of w (which 
follows from Proposition 12. 2p . More formally: 

Proposition 3.5. An infinite word w with F(w) closed under reversal satisfies equality (II) if and 
only if the following conditions hold: 

1) any simple path between a special factor and its reversal is palindromic; 

2) the super reduced Rauzy graph r"(w) is a tree for all n. 
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Proof. Suppose w is an infinite word with P(w) closed under reversal. Then w is recurrent by 
Proposition 12.21 We have already shown that conditions 1) and 2) imply that w satisfies equal- 
ity (II). Conversely, if at least one of conditions 1) and 2) does not hold, then V{n) + V{n + 1) < 
C(n + 1) — C{n) + 2 (by the arguments preceding this proposition), i.e., w does not satisfies equal- 
ity (II). □ 

To complete the proof of "(I) => (II)", it remains to show that any recurrent rich infinite word 
w satisfies condition 2) of Proposition 13.51 since we have already shown that condition 1) holds 
for any such w (using Lemma |3.3|) . The proof of the fact that w satisfies condition 2) uses the 
following two lemmas (Lemmas l3.6fj3"7T)) . 

Notation. Given two distinct special factors v, w of the same length n, we write v -f* w if there 
does not exist a directed edge from v to w in the reduced Rauzy graph T' n (w) (i.e., if there does 
not exist a simple path from v to w). 

Lemma 3.6. Suppose w is a recurrent rich infinite word and let v, w be two distinct special factors 
of w of the same length with v {w,w}. If there exists a simple path P from v to w, then P is 
unique and there also exists a unique simple path from w to v (namely, the reversal of P). Moreover: 

i) v -/-> w, and hence w -/-> v (unless w is a palindrome); 

ii) w -f+ v, and hence v ■/* w (unless v is a palindrome); 

Hi) w -f+ v, and hence v w (unless v and w are both palindromes). 

Proof. By closure under reversal (Remark 12 . X [) . if there exists a simple path P from v to w, then 
the reversal of P is a simple path from w to v in the Rauzy graph of order = \w\ = n. To prove 
the uniqueness of P, let us suppose there exist two different simple paths Pi , P2 from v to w in the 
Rauzy graph T n (w). Then 

Pi = vu\ ■ ■ ■ UkW and P2 = vz\ ■ ■ ■ zgw for some k, I G N, 

where u%, . . . , Uk, z±, . . . , zg are non-special factors of w of length n and ui 7^ Z{ for some i. Note 
that either Pi or P2 (not both) may be of the form vw. 

To keep the rest of the proof as simple as possible, we assume hereafter that neither v nor w 
is a palindrome; the arguments are similar, and in fact easier, in the cases when either v or w (or 
both) is a palindrome. 

Consider a path Q of minimal length beginning with Pi and ending with Pi (in the Rauzy graph 
l'n(w)): 

Q = Pi ■ ■ ■ P<2 = VU\ ■ ■ ■ Uk W ■■ ■ V y Z\ ■ ■ ■ Z£W. 

Qi 

First we observe that Q contains v since any path from v to itself must pass through v, by 
Lemma 13.41 Moreover, the left-most v in Q must occur in the subpath Qi (since v is not equal to 
any of the non-special factors Ui, Zj and v 7^ w). Therefore 

Q = vu\ ■ ■ ■ UkW ■••{;••• vz\ ■ ■ ■ zgw 



where the subpath Qi ends with the left-most v in the path Q. By Lemma 13.41 Qi is a path from 
v to v that does not contain v or v as an interior vertex. Thus, by Lemma 13.31 Q2 is palindromic, 
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and hence Q2 ends with the reversal of the path P± since it begins with Pi . More explicitly: 

Qa 

< * ^ 

Q = TOl • • • UkW ■ ■ ■ WUk ■ ■ ■ U\V ■ ■ ■ VZl ■ ■ ■ Z£W . 

Pi ft P2 

We distinguish two cases. 

Case 1: If the subpath Q3 contains u; as a terminal vertex only, then w is not an interior vertex of 
Q3 by Lemma 13.41 and hence Q3 is palindromic by Lemma 13.31 It follows that k = I and Zi = Ui 
for all i = 1, . . . , k. Thus Pi = P2; a contradiction. 

Case 2: If the subpath Q3 contains w as an interior vertex, then C/3 first passes through w after 
taking the path Pi (at the beginning) and before taking the path P2 (at the end). Hence, by 
Lemma 13. 3\ Q3 begins with a palindromic path from w to w that begins with Pi and hence 
ends with Pi. But then Q passes through the path Pi at least twice before taking the path P2, 
contradicting the fact that Q is a path of minimal length beginning with Pi and ending with P2. 

Both cases lead to a contradiction; thus the simple path P from v to w is unique (and its 
reversal P is the unique simple path from w to v). It remains to show that conditions hold. 
As ii) is symmetric to i), we prove only that i) and Hi) are satisfied. By what precedes, it suffices 
to consider paths in the reduced Rauzy graph T^(w). 

i): Arguing by contradiction, let us suppose that there exists a (unique) simple path from v to 
w, i.e., there exists a directed edge from v to w in the reduced Rauzy graph r^(w). Then (from 
above) we know that there also exists a directed edge from w to v. Consider a shortest path Q in 
the reduced Rauzy graph r^(w) beginning with vw and ending with vw. By Lemma 13.44 any path 
from v to itself passes through v, so we may write 

Q = vw ■ ■ - v ■ ■ ■ vw, 

Qi 

where the subpath Q\ ends with the left-most v in the path Q. By Lemmas I3.3H3.44 the path 
Qi = vw---v is palindromic, and hence it ends with wv. So we have Q = vw ■ ■ ■ wv ■ ■ ■ vw; 
moreover, by Lemma 13.41 w must occur between the last two iCs shown here. In particular, 

Q = vw ■ ■ ■ wv • • ■ w ■ ■ ■ vw 

Q2 

where the subpath Q2 contains ui as a terminal vertex only. Thus, by Lemmas I3.3H3.41 the path 
Q2 = wv ■ ■ ■ w is palindromic, and hence it ends with vw. But then Q ends with a shorter path of 
the form vw ■ ■ ■ vw, contradicting the fact that Q is a path of minimal length beginning with vw 
and ending with vw. 

Hi): Again, the proof proceeds by contradiction. Suppose there exists a (unique) simple path from 
w to v. Consider a shortest path Z in the reduced Rauzy graph T^(w) beginning with wv and 
ending with vw. By Lemma |3.4| the path Z must pass through w; thus 

Z = wv • • • w ■ ■ ■ vw. 
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where the subpath Z\ ends with the left-most w in the path Z. Now it follows from Lemmas 13.31 13, 4| 
that the subpath Z% is palindromic, and hence Z\ must end with vw. So we may write 

Z = wv ■ • - vw -j ■ v y w. 

If the subpath Z2 contains v as a terminal vertex only, then neither v nor v is an interior vertex of 
Z2 by Lemma 13.41 Thus Z2 is palindromic by Lemma 13.31 and hence Z2 ends with wv. But then 
the path Z ends with the path wvw, which is impossible by Lemma 13.41 Thus, the subpath Z2 
must pass through v at an earlier point, and hence we have Z2 = vw ■ ■ ■ v ■ ■ ■ v. In particular, the 
path Z2 begins with a palindromic subpath of the form vw ■ ■ ■ wv, by Lemma 13.31 But then the 
path Z ends with a shorter path from wv to vw, contradicting the minimality of Z. □ 

Notation. For a finite word v, let v e represent either v or v and set v~ e := v e . 

Lemma 3.7. Let w be a recurrent rich infinite word. For fixed n £ N+, suppose the super reduced 
Rauzy graph T"(w) contains at least three distinct vertices: [v\], [V2], [v s ], s > 3. Then, for 
each k with 3 < k < s, the reduced Rauzy graph r^(w) contains a path from v\ to v k k of the form: 

viv^ ■ ■ ■ v 2 v e 3 3 ■ ■ ■ v k -2v e k k Si ■ ■ ■ Vk-\v e k k , 

where for all i = 2, . . . , k — 1, the subpath v^ 1 ■ ■ ■ Vi (which may consist of only the single vertex v^) 
does not contain Vj, Vj for all j with 1 < j < k, j ^ i. 

Proof. We use induction on k and employ similar reasoning to the proof of Lemma 13.61 

First consider the case k = 3. Recurrence implies that r^(w) is connected, so we may assume 
without loss of generality that r^(w) contains a directed edge from v\ to v^ 2 , a directed edge from 
t>2 to V3 3 , and a path from v^ to V2- That is, r^(w) contains a path beginning with v\v^ and 
ending with V2V < 3 . Consider such a path of minimal length: 

Q = viv^ 2 ■ ■ ■ 

To prove the claim for k = 3, we show that none of the special factors v\, v\, U3, v% are interior 
vertices of Q. If v^ 2 = V2, then Q = viV2V^ 3 (by minimality) and we are done. So let us assume 
that v^ 2 = V2 7^ V2- 

Observe that if v\ is an interior vertex of Q, then v\ must be an interior vertex of Q since any 
path from v\ to itself must contain v\, by Lemma 13.41 Similarly, if v^ 3 is an interior vertex of Q, 
then v^ 63 is an interior vertex of Q. Therefore it suffices to show that v\ and v^ 63 are not interior 
vertices of Q. We prove this fact only for v\ as the proof is similar for v^ 63 . 

Arguing by contradiction, suppose v\ is an interior vertex of Q. Then Q begins with a palin- 
dromic path from V\v<i to v\ (by Lemmas I3.3H3.4]) . and this palindromic path clearly ends with 
V2V\. Hence 

Q = V1V2 ■ ■ - V2V1 ■ ■ ■ V2 y Vl 3 

Q' 

where the subpath Q' begins with a palindromic path from V2V1 to V2 (by Lemmas I3.3H3.4| ). and 
this palindromic path clearly ends with V\V2- But then the path Q ends with a shorter path from 
V1V2 to , contradicting the minimality of Q. Thus the lemma holds for k = 3. 
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Now suppose 4 < k < s and assume the claim holds for k — 1. Since r^(w) is connected, 
it contains a path beginning with v \v e 2 2 ■ ■ ■ v 2 v^ 3 ■ ■ ■ Vk- 2 v k k ~^ and ending with Vk-iv e k k (where the 
former path satisfies the conditions of the lemma). Consider such a path of minimal length: 

Z = V X v£ ■ ■ ■ V 2 vf ■ ■ ■ V k -2 vt k Sl ' • • V k-1 v k k ( 3 - 4 ) 

V v '» „ ' 

Z\ z 2 

where for all i = 2, . . . , k — 2, the subpath v^ 1 ■ ■ ■ V{ (which may consist of only the single vertex v^') 
does not contain Vj, Vj for all j with 1 < j < k — 1, j ^ i. To prove the induction step, we show 
that the path Z satisfies the following two conditions: 

i) the subpath Z\ contains neither v\. nor Vk', 

ii) the subpath Z 2 = v£~^ ■ ■ ■ v^-i does not contain Vj, Vj for all j with 1 < j < k, j ^ k — 1. 

First suppose that condition i) is not satisfied, i.e., Z\ contains v k or v^- Without loss of 
generality we assume that is the right-most of the vertices v^, Vk appearing in Z\. 

Case 1: Suppose vjr fc = Vk ^ Vk- Then Z ends with a path from Vk to itself, which must pass 
through Vk by Lemma |3.4( moreover, must be an interior vertex of Z 2 (by the choice of v^). 
Thus, by Lemmas l3.3H3.4l Z 2 Vk (and hence Z) ends with a palindromic path from to Vk~\Vk- 
Hence Z 2 contains VkVk-i, and we have: 

Z 2 v k = v e k h Si ■ ■ ■ v k v k -i ■ ■ ■ Vk-iVk 

" v ' 

z 3 

where the subpath Z% ends with a palindromic path from Vk~\ to VkVk~\ (by Lemmas l3.3H3.4p : 
thus Z3 contains Vk-iVk- But then Z begins with a shorter path from Z\ to Vk-iv € k k , contradicting 
the minimality of Z. 

Case 2: Suppose v € k k = Vk- Then the path Z (= Z\Z 2 Vk) ends with a path of the form: 

^4 = v k Z 2 v k - 

no v k ,v k 

If Vk or Vk is an interior vertex of Z 2 , then we reach a contradiction using the same arguments as in 
Case 1. On the other hand, if neither Vk nor Vk is an interior vertex of Z 2 , then Z4 is palindromic 
by Lemma [3731 So the path Z4 begins with v k Vk-i since it ends with Vk-iVk- But then v k -i is an 
interior vertex of Z\, a contradiction. 

Thus the path Z satisfies condition i). In proving this fact, we have also shown that v k , v k are 
not interior vertices of Z 2 . It remains to show that the subpath Z 2 does not contain v j , Vj for all j 
with 1 < j < k — 2 (and hence Z satisfies condition ii)). We prove only that Z 2 does not contain 
v± or v\ since the proof is similar when considering other Vj, Vj. 

Suppose on the contrary that Z 2 contains v\ or v\. Then, by Lemmas l3.3H3.4l Z begins with 
a palindromic path from v\ to v\, and this palindromic path begins with Y = Ziv k k ~^ (and hence 
ends with Y) by the conditions on Z under the induction hypothesis. More explicitly, we have: 

palindromic 



Z = viv e 2 2 ■ ■ ■ Vk- 2 v e k _l ■ ■ ■ v k *\ x Vk-2 ■ • -v 2 e2 vi ■ ■ ■ Vk-iv'k 

Y 
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Hence, as v k -i and Vk-i are n °t interior vertices of Y (by the induction hypothesis), the subpath 
Y Z^ begins with a palindromic path from v7^H to and this palindromic path begins with Y 

(and hence ends with Y), by Lemmas I3.3H3.4I But then Z ends with a shorter path from Y to 
Vk—l v V°i contradicting the minimality of Z. 

We conclude that the subpath Z2 = v k k ~^ ' ' ' v fc-i does not contain Vj, Vj for all j with 1 < j < k, 
j 7^ i (i.e., the path Z satisfies condition ii)), and the proof is thus complete. □ 

Lemma 3.8. Suppose w is a recurrent rich infinite word. Then the super reduced Rauzy graph 
r" (w) is a tree for all n £ N+ . 

Proof. First recall that for all n, r"(w) is connected (by the recurrence property of w). Moreover, 
Lemma 13.61 tells us that if two distinct vertices in r"(w) are joined by an edge, then this edge is 
unique (and corresponds to a simple path and its reversal). It remains to show that r"(w) does 
not contain any cycle (i.e., does not contain a chain linking a vertex with itself). 

Suppose on the contrary that T"(w) contains a cycle for some n. Then r"(w) must contain at 
least three distinct vertices: [vi], [112], • • • , [v s ], s > 3, and a cycle of the following form: 

[v\] — [^2] — • • • — [vk] — [vi] for some k with 3 < k < s. (3-5) 

We thus deduce from Lemma [3771 that the reduced Rauzy graph r^(w) contains a path from v\ to 
vl 1 of the form: 

P = VlV e 2 2 ■ ■ ■ V2V f ■ ■ ■ v k _ 2 v e k k Si ■ ■ ■ Vk-iv e k k ■ ■ ■ VkVl 1 , 

where for all i = 2, . . . , k, the subpath v^ 1 ■ ■ ■ Vi (which may consist of only the single vertex v^ 1 ) 
does not contain Vj, Vj for all j with 1 < j < k, j 7^ i. (Note that P corresponds to the cycle given 
in (pD)}.) 

First suppose that V\ is a palindrome. In this neither v\ nor v\ is an interior vertex of 

P, it must be a palindromic path by Lemma I3.3L But then v k = v^ 62 , a contradiction (as k > 3). 

Now suppose that v\ is not a palindrome. If i^ 1 = v\, then we deduce (as above, using 
Lemma l3.3p that the path P must be palindromic, yielding a contradiction. On the other hand, if 
v\ = Vi, then, by Lemma l3~4"l the path P must pass through v\, a contradiction. 

Thus r"(w) is a tree. □ 

This concludes our proof of the "(I) =>■ (II)" part of Theorem ll.il 
3.2 (II) implies (I) 

Conversely, suppose w is an infinite word with F(w) closed under reversal and satisfying equal- 
ity (II). Then w satisfies conditions 1) and 2) of Proposition 13.51 

Now, arguing by contradiction, suppose w does not satisfy property (I) (i.e., w is not rich). 
Then there exists a palindromic factor p that has a non-palindromic complete return u in w; in 
particular, we have u = pqavbqp for some words q, v (possibly empty) and letters a, b, with a 7^ b. 
So the words pqa, bqp and their reversals aqp, pqb are factors of w. Thus pq (resp. qp) is a right- 
special (resp. left-special) factor of w. Hence, if u does not contain any other special factors, then 
u forms the label of a non-palindromic simple path beginning with pq and ending with qp. But 
this contradicts condition 1) of Proposition 13.51 Therefore u must contain other special factors of 
length n := \pq\, besides pq and qp. In particular, u begins with the label of a simple path of order 
n beginning with pq and ending with another special factor si of length n. Similarly, u ends with 
the label of a simple path of order n beginning with a special factor s 2 of length n and ending with 
qp. Moreover, since u is a complete return to p, neither s\ nor s 2 is equal to pq or qp (otherwise 
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p occurs as an interior factor of u). Thus, in the super reduced Rauzy graph r"(w), there is an 
edge between the vertex [pq] and each of the vertices [si] and [S2]. In particular, there exists a 
path of the form: [s\] — \pq] — [52]- Furthermore, as u contains a factor that begins with s\ and ends 
with S2 and contains no occurrence of pq or qp, there also exists a chain (or possibly just an edge) 
linking [s±] and [$2] that does not contain the vertex \pq\. Thus, if {si,si} 7^ {s2,S2}, then we see 
that r^(w) contains a cycle, contradicting condition 2) of Proposition 13.51 On the other hand, if 
{si, s\} = {s2, S2}, then there are at least two edges joining the vertices [s\] and \pq]. Indeed, there 
exists a simple path P\ from pq to s\ and there also exists a simple path P2 either from s\ to qp or 
from si to qp. By closure under reversal, the reversals Pi, P2 of the respective simple paths Pi, P2 
also exist. Moreover, none of these four simple paths coincide. Certainly, P\ ^ P2, Pi 7^ Pi, and 
P2 7^ Pi as neither si nor §1 is equal to pq or qp, and Pi 7^ P2 as the second vertex in Pi ends with 
the letter a, whereas the second vertex in the path P2 ends with the letter b 7^ a. So T"(w) is not 
a tree, contradicting condition 2) of Proposition 13.51 This concludes our proof of Theorem ll.il □ 

4 A few consequences and remarks 

From Theorem 1 we easily deduce that property (I) is equivalent to equality (II) for any uniformly 
recurrent infinite word. Indeed, equality (II) implies the existence of arbitrarily long palindromes 
since V{n) +V(n + 1) > 2 for all n, so together with uniform recurrence one can readily show that 
factors are closed under reversal; hence property (I) holds by Theorem 11.11 Conversely, richness 
(property (I)) together with uniform recurrence implies closure under reversal by Remark l2.lt and 
hence equality (II) holds. 

Question: In the statement of Theorem can the hypothesis of factors being closed under 
reversal be replaced by the weaker hypothesis of recurrence ? 

As above, it follows directly from Theorem 11.11 and Remark 12.11 that for any recurrent infinite 
word w, if w satisfies property (I) (i.e., if w is rich, and hence has factors closed under reversal), 
then equality (II) holds. However, to prove the converse using our methods, one would need to 
know that any recurrent infinite word satisfying equality (II) has factors closed under reversal. We 
could not find a proof of this claim nor could we find a counter-example. Let us point out that 
whilst uniform recurrence and the existence of arbitrarily long palindromes imply closure under 
reversal, this is not true in the case of recurrence only. For instance, consider the following infinite 
word: 

s = bca bca bca bca bca bca bca bca be- ■ ■ , 
which is the limit as n goes to infinity of the sequence (s n )n>i of finite words defined by: 

s\ = be and s n = s n -ia n s n ^i for n > 1. 

This infinite word is clearly recurrent (but not uniformly recurrent) and contains arbitrarily long 
palindromes, but its set of factors is not closed under reversal. (Note that s is not rich and does 
not satisfy equality (II).) If one could show that recurrence together with equality (II) implies 
arbitrarily long palindromic prefixes, this would be enough to prove that factors are closed under 
reversal. 

In the context of finite words w, the hypothesis of factors being closed under reversal can be 
replaced by the requirement that w is a palindrome. Indeed, all we really need is the super reduced 
Rauzy graph to be connected, which is true for palindromes. 
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Theorem 4.1. For any palindrome w, the following properties are equivalent: 
i) w contains \w\ + 1 distinct palindromes; 
ii) all complete returns to palindromes in w are palindromes; 

Hi) V(i) + V{% + 1) = C(i + 1) - C(i) + 2 for all i with < i < \w\. □ 
We now prove two easy consequences of Theorem 11.11 
Corollary 4.2. Suppose w is a recurrent rich infinite word. Then the following properties hold, 
i) w is (purely) periodic if and only ifV(n) + V{n + 1) = 2 for some n. 

ii) (V(n)) n >i is eventually periodic with period 2 if and only if there exist non-negative integers 
K , L, N such that C{n) = Kn + L for all n > N . 

Proof. Suppose w is a recurrent rich infinite word. Then V{n) + V(n + 1) = C(n + 1) — C(n) + 2 
for all n, by Theorem 1 1 . 1 1 and Remark 12.11 

i) : If V(n) + V(n + 1) = 2 for some n, then C{n + 1) = C(n), and hence w is eventually periodic; 
in particular, w must be (purely) periodic as it is recurrent. Conversely, if w is periodic, then 
C(n + 1) = C(n) for some n, and hence V(n) + V{n + 1) = 2. 

ii) : The condition on C(n) implies that for all n > N, C(n + 1) — C(n) = K, and hence V(n) + 
V(n+1) = K + 2 = V(n + l) + V(n + 2). Thus V{n) = V{n + 2) for all n > N. Conversely, suppose 
(V(n)) n >i is eventually periodic with period 2. Then there exists a non-negative integer N such 
that V(n) = Tin + 2) for all n > N. Hence, for all n > N, V(n) + V{n + 1) = C(n + 1) - C(n) + 2 = 
V(n + 1) + V(n + 2) = M > 2. Therefore C(n + 1) - C(n) = M - 2 for all n > N. □ 

Remark 4.3. Item ii) of the above corollary can be compared with a result of J. Cassaigne [8], 
who proved that if C(n) has linear growth, then C(n + 1) — C(n) is bounded. 

Remark 4.4. In [5], Balazi et al. remarked: "According to our knowledge, all known ex- 
amples of infinite words which satisfy the equality V(n) + V(n + 1) = C(n + 1) — C(n) + 2 
for all n € N have sublinear factor complexity." Actually, there do exist recurrent rich in- 
finite words with non-sublinear complexity. For instance, the following example from [19J: 
abab 2 abab^abab 2 abab A abab 2 abab^abab 2 abaft ■ ■ ■ (which is the fixed point of the morphism: a i— > abab, 
6 i — > 6) is a recurrent rich infinite word and its complexity C{n) grows quadratically with n. Another 
example that was indicated to us by J. Cassaigne is the fixed point of a i— > aab, b^b: 

aabaabbaabaabbbaabaabbaabaabbbbaabaabbaabaabbbaabaabbaabaabbbbb ■ ■ ■ . 

It is a recurrent rich infinite word and its complexity is equivalent to n 2 /2. More precisely, V(n) + 
V(n + 1) - 2 = C(n + 1) - C(n) = n + 1 - J){/c> | 2 k + k - 2 < n}. 

In [TO], X. Droubay et al. showed that the family of episturmian words (e.g., see [TO 1 |2T | [TO]), 
which includes the well-known Sturmian words, comprises a special class of uniformly recurrent 
rich infinite words. Specifically, they proved that if an infinite word w is episturmian, then any 
factor u of w contains exactly \u\ + 1 distinct palindromic factors (see |1CH Cor. 2]). An alternative 
proof of the richness of episturmian words can be found in the paper [3] where the fourth author, 
together with V. Anne and I. Zorca, proved that for episturmian words, all complete returns to 
palindromes are palindromes. (A shorter proof of this fact is also given in [7.].) More recently, 
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P. Balazi et al. [5] showed that all strict episturmian words (i.e., Arnoux-Rauzy sequences [H I23j) 
satisfy V(n) + V(n + 1) = C(n + 1) — C(n) + 2 for all n. This fact, together with Theorem II .14 
provides yet another proof that all episturmian words are rich (since any factor of an episturmian 
word is a factor of some strict episturmian word). 

Sturmian words are exactly the aperiodic episturmian words over a 2- letter alphabet. They have 
complexity n + 1 for each n and are characterized by their palindromic complexity: any Sturmian 
word has V{n) = 1 whenever n is even and V(n) = 2 whenever n is odd (see From these 

observations, one can readily check that Sturmian words satisfy equality (II) (and hence they are 
rich). 

We can now say even more: the set of factors of all Sturmian words satisfies equality (II). To 
show this, we first recall that F. Mignosi [22J proved that, for any n > 0, the number c(n) of finite 
Sturmian words of length n is given by 

n 

c(n) = 1 + J^(n + l-i)0(»), 
i=i 

where (ft is Euler's totient function. More recently, in [9], the second author together with A. de Luca 
proved that for any n > 0, the number p(n) of Sturmian palindromes of length n is given by 

\n/2\-l 

p[n) = 1 + <j){n — 2i). 

i=0 

Equivalently, for any n > 0, 

n n 

p(2n) = 1 + ^2<j)(2i) and p(2n + 1) = 1 + ^ cj)(2i + 1). 

i=l i=0 

Thus, for all n > 0, 

n 2n+l 

p(2n) + p(2n + 1) = 2 + {<f>{2i) + (/>(2i + 1)} + 2 = ^ <f>(i) + 2, 

i=i i=i 

and 

2n+l 2n 



c(2n + 1) - c(2n) + 2 = ^ (2n + 2 - i)<f>(i) - J^(2n + 1 - i)<p{i) + 2 

i=i i=i 

2n 

= 0(2n + l) + ^0(i) + 2 
i=i 

2n+l 

= V 0(i) +2 =p(2n) +p(2n + 1). 



i=i 

From this point of view, it would be interesting to count for instance the number of all binary 
rich words of length n for each n. 
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